15 research outputs found

    Quantifying Social Biases Using Templates is Unreliable

    Full text link
    Recently, there has been an increase in efforts to understand how large language models (LLMs) propagate and amplify social biases. Several works have utilized templates for fairness evaluation, which allow researchers to quantify social biases in the absence of test sets with protected attribute labels. While template evaluation can be a convenient and helpful diagnostic tool to understand model deficiencies, it often uses a simplistic and limited set of templates. In this paper, we study whether bias measurements are sensitive to the choice of templates used for benchmarking. Specifically, we investigate the instability of bias measurements by manually modifying templates proposed in previous works in a semantically-preserving manner and measuring bias across these modifications. We find that bias values and resulting conclusions vary considerably across template modifications on four tasks, ranging from an 81% reduction (NLI) to a 162% increase (MLM) in (task-specific) bias measurements. Our results indicate that quantifying fairness in LLMs, as done in current practice, can be brittle and needs to be approached with more care and caution

    The Bias Amplification Paradox in Text-to-Image Generation

    Full text link
    Bias amplification is a phenomenon in which models increase imbalances present in the training data. In this paper, we study bias amplification in the text-to-image domain using Stable Diffusion by comparing gender ratios in training vs. generated images. We find that the model appears to amplify gender-occupation biases found in the training data (LAION). However, we discover that amplification can largely be attributed to discrepancies between training captions and model prompts. For example, an inherent difference is that captions from the training data often contain explicit gender information while the prompts we use do not, which leads to a distribution shift and consequently impacts bias measures. Once we account for various distributional differences between texts used for training and generation, we observe that amplification decreases considerably. Our findings illustrate the challenges of comparing biases in models and the data they are trained on, and highlight confounding factors that contribute to bias amplification

    The 16th Data Release of the Sloan Digital Sky Surveys: First Release from the APOGEE-2 Southern Survey and Full Release of eBOSS Spectra

    Get PDF
    This paper documents the 16th data release (DR16) from the Sloan Digital Sky Surveys (SDSS), the fourth and penultimate from the fourth phase (SDSS-IV). This is the first release of data from the Southern Hemisphere survey of the Apache Point Observatory Galactic Evolution Experiment 2 (APOGEE-2); new data from APOGEE-2 North are also included. DR16 is also notable as the final data release for the main cosmological program of the Extended Baryon Oscillation Spectroscopic Survey (eBOSS), and all raw and reduced spectra from that project are released here. DR16 also includes all the data from the Time Domain Spectroscopic Survey and new data from the SPectroscopic IDentification of ERosita Survey programs, both of which were co-observed on eBOSS plates. DR16 has no new data from the Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey (or the MaNGA Stellar Library "MaStar"). We also preview future SDSS-V operations (due to start in 2020), and summarize plans for the final SDSS-IV data release (DR17)

    The 16th Data Release of the Sloan Digital Sky Surveys : First Release from the APOGEE-2 Southern Survey and Full Release of eBOSS Spectra

    Get PDF
    This paper documents the 16th data release (DR16) from the Sloan Digital Sky Surveys (SDSS), the fourth and penultimate from the fourth phase (SDSS-IV). This is the first release of data from the Southern Hemisphere survey of the Apache Point Observatory Galactic Evolution Experiment 2 (APOGEE-2); new data from APOGEE-2 North are also included. DR16 is also notable as the final data release for the main cosmological program of the Extended Baryon Oscillation Spectroscopic Survey (eBOSS), and all raw and reduced spectra from that project are released here. DR16 also includes all the data from the Time Domain Spectroscopic Survey and new data from the SPectroscopic IDentification of ERosita Survey programs, both of which were co-observed on eBOSS plates. DR16 has no new data from the Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey (or the MaNGA Stellar Library "MaStar"). We also preview future SDSS-V operations (due to start in 2020), and summarize plans for the final SDSS-IV data release (DR17).Peer reviewe

    The Seventeenth Data Release of the Sloan Digital Sky Surveys: Complete Release of MaNGA, MaStar and APOGEE-2 Data

    Get PDF
    This paper documents the seventeenth data release (DR17) from the Sloan Digital Sky Surveys; the fifth and final release from the fourth phase (SDSS-IV). DR17 contains the complete release of the Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey, which reached its goal of surveying over 10,000 nearby galaxies. The complete release of the MaNGA Stellar Library (MaStar) accompanies this data, providing observations of almost 30,000 stars through the MaNGA instrument during bright time. DR17 also contains the complete release of the Apache Point Observatory Galactic Evolution Experiment 2 (APOGEE-2) survey which publicly releases infra-red spectra of over 650,000 stars. The main sample from the Extended Baryon Oscillation Spectroscopic Survey (eBOSS), as well as the sub-survey Time Domain Spectroscopic Survey (TDSS) data were fully released in DR16. New single-fiber optical spectroscopy released in DR17 is from the SPectroscipic IDentification of ERosita Survey (SPIDERS) sub-survey and the eBOSS-RM program. Along with the primary data sets, DR17 includes 25 new or updated Value Added Catalogs (VACs). This paper concludes the release of SDSS-IV survey data. SDSS continues into its fifth phase with observations already underway for the Milky Way Mapper (MWM), Local Volume Mapper (LVM) and Black Hole Mapper (BHM) surveys

    A nationwide study of adults admitted to hospital with diabetic ketoacidosis or hyperosmolar hyperglycaemic state and COVID‐19

    Get PDF
    AimsTo investigate characteristics of people hospitalized with coronavirus-disease-2019 (COVID-19) and diabetic ketoacidosis (DKA) or hyperosmolar hyperglycaemic state (HHS), and to identify risk factors for mortality and intensive care admission.Materials and methodsRetrospective cohort study with anonymized data from the Association of British Clinical Diabetologists nationwide audit of hospital admissions with COVID-19 and diabetes, from start of pandemic to November 2021. The primary outcome was inpatient mortality. DKA and HHS were adjudicated against national criteria. Age-adjusted odds ratios were calculated using logistic regression.ResultsIn total, 85 confirmed DKA cases, and 20 HHS, occurred among 4073 people (211 type 1 diabetes, 3748 type 2 diabetes, 114 unknown type) hospitalized with COVID-19. Mean (SD) age was 60 (18.2) years in DKA and 74 (11.8) years in HHS (p < .001). A higher proportion of patients with HHS than with DKA were of non-White ethnicity (71.4% vs 39.0% p = .038). Mortality in DKA was 36.8% (n = 57) and 3.8% (n = 26) in type 2 and type 1 diabetes respectively. Among people with type 2 diabetes and DKA, mortality was lower in insulin users compared with non-users [21.4% vs. 52.2%; age-adjusted odds ratio 0.13 (95% CI 0.03-0.60)]. Crude mortality was lower in DKA than HHS (25.9% vs. 65.0%, p = .001) and in statin users versus non-users (36.4% vs. 100%; p = .035) but these were not statistically significant after age adjustment.ConclusionsHospitalization with COVID-19 and adjudicated DKA is four times more common than HHS but both associate with substantial mortality. There is a strong association of previous insulin therapy with survival in type 2 diabetes-associated DKA

    A nanotube/polymer composite biosensing thin-film transistor platform for C-reactive protein detection

    No full text
    In the present study, a back-gate thin-film transistor (TFT) based on a single-walled carbon nanotubes (SWCNTs)/poly(9,9-dioctylfluorene-co-bithiophene) (F8T2) composite semiconductor is proposed as electronic label-free immunosensor for the detection of the C-reactive protein (CRP). The nano-composite semiconducting ultra-thin film is deposited on the channel area of the TFT and is functionalized with a monoclonal anti-CRP antibody coating acting as biorecognition layer. The immunesensor was then used for the selective detection of the CRP both in phosphate buffered saline (PBS) solution and in human serum samples. The proposed immunosensor exhibits high current (10 - 20 ÎŒA) and good field-effect carriers mobility. Furthermore, the CRP could be revealed at a detection limit as low as 1 mg/L and in a wide dynamic range (from 0.4 nM to 2.2 ÎŒM)

    Solvent-gated thin-film-transistors

    No full text
    Electrical double layer (EDL) thin film transistors (TFTs) are an interesting class of transistors that use an electrolyte as the gating medium. Recently it has been demonstrated that pure organic solvents can also be used as gating media for TFTs without the addition of exogenous electrolytes. Here we present a systematic study of the performances of TFTs based on two different semiconductors (P3HT and ZnO) gated through nine different solvents either pure or loaded with NaCl. The nature of the solvent impacts the transfer characteristics of the TFT through a change in the gating capacitance while the threshold voltage remains unaffected. Depending on the polarity of solvents, addition of NaCl gives rise to different responses. TFTs gated through highly polar solvents are unaffected by the salt concentration while for low polarity solvents the output current increases with salt up to a plateau. Furthermore, when the semiconductor surface is covered with a high capacitance thin dielectric layer, the TFT output current becomes dependent on the NaCl concentration also for high polarity solvents. This phenomenology was rationalized considering the different contributions of Helmholtz and Guy-Chapman EDLs to the capacitance and the dielectric saturation that decreases the solvent dielectric constant within the Helmholtz EDL
    corecore